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Title : Commercial Production of Chymosin in Plants 
FIELD OF THE INVENTION 

The present invention relates to improved methods for the 
recombinant production and isolation of chymosin from plants. 
5 BACKGROUND OF THE INVENTION 

Chymosin, also known as rennin, is a commercially important 
enzymatic protein, commonly used in the cheese manufacturing industry 
to coagulate milk. Traditionally chymosin has been prepared from its 
natural source, the fourth stomach of unweaned calves, although recovery 
10 from the stomachs of other mammals, such as lamb, goats etc. heretofore 
was known. More recently, primarily as a result of a decrease in calf 
production, recombinant DNA techniques have been employed to produce 
chymosin by fermentation in genetically engineered microorganisms. 
Thus a variety of bacterial and fungal hosts have been genetically modified 
15 to produce chymosin by fermentation, including for example, the bacterial 
hosts Escherichia coli, (European Patent 0 134 662 Al; Nishimori et al 
(1982) J. Biochem 91: 1085-1088.), Bacillus subtilis (US patent 5,624,819; 
5,716,807 and Parente et al (1991) FEMS 77: 243-250) and the fungal hosts 
Aspergillus sp. (European Patent 0 575 462 Bl; US patents 5,364,770 and 
20 5,863,759; Cullen et al (1987) Bio /Technology 5: 369-375., Dunn-Coleman et 
al. (1991) Bio /Technology 9: 976-981., and Tsuchiua et al. (1993) Appl. 
Microbial Biotech. 40: 327-332), Kluyvcromyces lactis (van der Berg et al 
(1990) Bio /Technology 8: 135-139 and Trichoderma ressei (Jarkki et al. 
(1989) Bio /Technology 7: 596-603; Pitts et al. (1991) Biochemical Society 
25 Transactions 19: 663-665). As well, more general expression in fungi, yeast 
and bacteria (US Patents 4,666,847) and in filamentous fungi (US patent 
5,578,463). 

The active enzyme chymosin (E.C. 3.4.23.4) is comprised of a 
polypeptide chain of a molecular mass of 35.6 kDa. However crude extracts 
30 of calf stomach mucosa in addition to active chymosin, contain two 
inactive precursor polypeptides known as pre-pro-chymosin and 
pro-chymosin. Pre-pro-chymosin contains an extra 58 amino acids at the 
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N-termLnus, whereas pro-chymosin contains an extra 42 amino acids. 
Conversion of the inactive precursor protein into enzymatically active 
chymosin requires the step-wise removal of the chymosin pre-peptide and 
pro-peptide. In vivo these activation steps take place in the calf stomach. 
5 The chymosin pre-peptide directs secretion of the polypeptide by the 
stomach cells and is removed upon secretion of the polypeptide by the 
stomach cells. The chymosin pro-peptide is subsequently removed in the 
gastric lumen, thereby activating the enzyme. The activation reaction can 
also be performed in vitro at pH values below 5. With regards to the 
10 enzyme chymosin, it should further be noted that chymosin purified from 
calf stomach is a mixture of two different polypeptides known as chymosin 
A and chymosin B. Both of these polypeptides are active and differ only 
with respect to one amino acid. The amino acid residue at position #290 is 
an aspartate residue in chymosin A and a glycine residue in chymosin B 
15 (Foltman et aL, (1977) Proc. Natl. Acad. Sci. USA 74: 2331-2324; Foltman et 
al, (1979) J. Biol. Chem. 254: 8447-8456). 

There are several disadvantages associated with the 
recombinant production of chymosin in fermentation systems. In general, 
fermentation systems require the use of large fermentation vessels that 
20 have both large space and energy requirements and consequently are 
costly. As well, the growth media require large volumes of water and may 
require special chemicals. Both of these may present environmental 
issues in the disposal of the large amounts of potentially harmful waste. 
Further, storage and shipment of raw material containing chymosin is 
25 problematic. The bacterial or fungal fermentation broth need to be 
processed immediately or refrigerated in large volumes since the enzyme 
is not stable for long periods in the broth. 

The use of plants as bioreactors for the commercial production 
of recombinant proteins is well known. For example, avidin, 
30 (J-glucuronidase and aprotinin (see patents US Patents 5,767,379, 5,804,694 
and 5,824,870) have been recombinantly expressed in corn. Further, US 
Patents 5,543,576 and 5,714,474 are broadly directed to the recombinant 
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production of enzymes in seeds and to the use of seeds or milled seeds 
comprising enzymes as a raw material in the preparation of food and feed 
products. Although US Patents 5,543,576 and 5,714,474 suggest chymosin 
as one potential enzyme that may be produced in seeds, there is no 
5 reduction to practice. These patents are further limited by the fact that in 
order to use the chymosin for the commercial production of cheese, 
chymosin would have to be purified from the seed or milled seeds. 

PCT patent application WO 92/01042 discloses the expression of 
chymosin in the leaves of transgenic tobacco and potato plants. According 
10 to the disclosure chymosin expression levels of only 0.1% to 0.5% (w/w) of 
total soluble leaf protein were attained. The methodology of WO 92/01042 
is further limited in that the production in leaves would require 
immediate extraction of the enzyme from the leaf material upon 
harvesting of the plants as the enzyme would lose activity when stored in 
15 leaves. In addition, due to the relatively high water content of leaves, 
large amounts of biomass must be processed. 

There is a need in the art to further improve methods for the 
recombinant expression of chymosin in plants. 
SUMMARY OF THE INVENTION 
20 The present invention relates to novel and improved methods 

of producing commercial levels of chymosin in transgenic plants. The 
inventors have found that chymosin when expressed in the seeds of 
transgenic plants accumulates to levels of at least 0.5% (w/w) of total seed 
protein. 

25 Accordingly, the invention provides a method for the 

production of chymosin in a plant seed comprising: 

a) introducing into a plant cell a chimeric nucleic acid 
sequence molecule comprising in the 5' to 3' direction of transcription: 

1) a first nucleic acid sequence capable of regulating 
30 transcription in said plant cell operatively linked to; 

2) a second nucleic acid sequence encoding a chymosin 
polypeptide operatively linked to; 
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3) a third nucleic acid sequence capable of terminating 
transcription in said plant cell; 
b) growing said plant cell into a mature plant capable of 
setting seed; and 

5 c) obtaining seed from the mature plant wherein said seed 

contains chymosin. 

Preferably, at least 0.5% (w/w) of the total seed protein is 

chymosin. 

The present invention also provides a method for the 
10 production of plant seeds containing at least 0.5% (w/w) chymosin in the 
total seed protein comprising: 

(a) introducing into each of at least two plant cells a chimeric 
nucleic acid sequence molecule comprising in the 5' to 3' direction of 
transcription: 

15 1) a first nucleic acid sequence capable of regulating 

transcription in said plant cell operatively linked to; 

2) a second nucleic acid sequence encoding a chymosin 
polypeptide operatively linked to; 

3) a third nucleic acid sequence capable of terminating 
20 transcription in said plant cell; 

(b) growing each plant ceil into a mature plant capable of 
setting seed; 

(c) obtaining seed from each mature plant; 

(d) detecting the levels of chymosin in the seed of each plant 
25 obtained in step (c) or in the seed of a plant generated from the seed of a 

plant obtained in step (c); and 

(e) selecting plants that contain at least 0.5% (w/w) chymosin 
in the total seed protein. 

In preferred methods of the present invention, the nucleic acid 
30 sequence capable of regulating transcription is a seed-specific promoter. In 
further preferred methods, the chimeric nucleic acid sequence additionally 
comprises a signal sequence capable of targeting the chymosin polypeptide 
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to the plant apoplast. In further preferred methods, the nucleic acid 
sequence encoding chymosin sequence is optimized for plant codon usage 
and the chymosin sequence further contains the chymosin pro-peptide or 
pre-pro-peptide or pre-peptide sequences. 
5 In a further aspect, the present invention provides plant seeds 

expressing chymosin. In a preferred embodiment of the present 
invention, the plant seeds comprise a chimeric nucleic acid sequence 
comprising in the 5' to 3' direction of transcription: 

1) a first nucleic acid sequence capable of regulating 
10 transcription in said plant cell operatively linked to; 

2) a second nucleic acid sequence encoding a chymosin 
polypeptide operatively linked to; 

3) a third nucleic acid sequence capable of terminating 
transcription in said plant cell wherein the seed contains 

15 chymosin. 

Preferably, at least 0.5% (w/w) of the total seed protein is 

chymosin. 

In another aspect the present invention provides plants capable 
of setting seed expressing chymosin. In a preferred embodiment of the 
20 invention, the plants capable of setting seed comprise a chimeric nucleic 
acid sequence comprising in the 5' to 3' direction of transcription: 

1) a first nucleic acid sequence capable of regulating 
transcription in a plant cell operatively linked to; 

2) a second nucleic acid sequence encoding a chymosin 
25 polypeptide operatively linked to; 

3) a third nucleic acid sequence capable of terminating 
transcription in said plant cell, wherein the seed contains 
chymosin. 

In yet another aspect the present invention provides a method 
30 for recovering chymosin from plant seeds. Accordingly, the present 
invention provides a method for obtaining chymosin from a plant seed 
comprising: 
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a) introducing into a plant cell a chimeric nucleic acid 
sequence molecule comprising in the 5' to 3' direction of transcription: 

1) a first nucleic acid sequence capable of regulating 
transcription kvsaid plant cell operatively linked to; 
5 2) a second nucleic acid sequence encoding a chymosin 

polypeptide operatively linked to; 

3) a third nucleic acid sequence capable of terminating 
transcription in said plant cell; 

b) growing said plant cell into a mature plant capable of 
10 setting seed; 

c) obtaining seed from the mature plant wherein said seed 
contains chymosin; and 

d) isolating said chymosin from said seed. 

In preferred embodiments, isolation of chymosin from seed in 
15 step (d) comprises: 

(i) crushing of the plant seed to obtain crushed plant seed; 

(ii) contacting the crushed plant seed or a fraction thereof 
with a protein binding resin; and 

(iii) recovering the chymosin from the protein binding resin. 
20 In further preferred embodiments upon crushing of the plant 

seed the crushed seed material is fractionated into (a) an aqueous phase 
containing substantially all of the chymosin, (b) an oil fraction, and (c) a 
fraction containing the insoluble material insoluble material. Accordingly 
step (d) more preferably comprises: 
25 (i) crushing of the plant seed to obtain crushed plant seed; 

(ii) fractionating the crushed plant seed into an oil fraction, 
aqueous fraction and a fraction comprising insoluble 
material; 

(iii) contacting the aqueous fraction with a protein binding 
30 resin; and 

(iv) recovering the chymosin from the protein binding resin. 
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In a preferred embodiment, the protein binding resin is a 
hydrophobic interaction resin. In further preferred embodiments of the 
invention, the isolation of the chymosin further comprises the 
employment of an ion exchange resin and a hydrophobic interaction resin. 

5 Other features and advantages of the present invention will 

become readily apparent from the following detailed description. It should 
be understood, however, that the detailed description and the specific 
examples while indicating preferred embodiments of the invention are 
given by way of illustration only, since various changes and modifications 

10 within the spirit and scope of the invention will become apparent to those 
skilled in the art of this detailed description. 
BRIEF DESCRIPTION OF THE DRAWINGS 

The invention will now be described in relation to the 
drawings in which: 

15 Figure 1 shows the nucleotide sequence (SEQ.ID.NO.:l) and 

corresponding amino acid sequence (SEQ.ID.NO.:2) of the open reading 
frame of a pre-pro-chymosin sequence. The "pre" sequence is indicated in 
Italics between and including amino acids 1 to 26. The "pre" sequence 
encodes a signal sequence identical to the PR-S signal sequence from 
20 tobacco sequence (Sijmons et al. (1990) Bio /technology 8: 217-221). Amino 
acids 27 to 67 inclusive are the "pro" sequence with the remaining amino 
acids encoding the mature chymosin polypeptide. 

Figure 2 shows the nucleotide sequence (SEQ.ID.NO.:3) of the 
phaseolin promoter- a pre-pro-chymosin-phaseolin terminator sequence 
25 responsible for the high levels of expression of chymosin in plant seeds. 

Figure 3 is a Western blot analysis comparing a chymosin 
standard and a protein extract of seeds from a Brassica plant expressing 
chymosin. 

Figure 4 is a bar diagram showing the expression of chymosin 
30 in flax seeds derived from independent transformed flax plants. 
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Figure 5 shows a SDS-polyacrylamide gel showing progressive 
purification of chymosin obtained from transgenic seeds of Brassica napus 
as described in example 5. 



improved methods for the production of chymosin in transgenic plants. 
The present inventors have surprisingly found that by expressing 
chymosin in the seeds of plants, chymosin accumulation levels exceeding 
0.5% (w/w) of total seed protein may be attained. These high expression 

10 levels in plant seeds allow significant commercial savings since the 
acreage of plants that needs to be grown can be limited and the amount of 
biomass that must to be subjected to extraction is reduced. The amount of 
biomass processed is further limited due to the relatively low water 
content present in plant seed. Furthermore, the expression in plants seed 

15 offers flexibility in storage and shipment of chymosin as a raw material, 
since chymosin retains its enzymatic activity upon extraction from stored 



DETAILED DESCRIPTION OF THE INVENTION 



5 



As hereinbefore mentioned, the present invention relates to 



seed. 



20 



25 



30 



Accordingly, the invention provides a method for producing 
chymosin in plant seeds comprising: 

a) introducing into a plant cell a chimeric nucleic acid 
sequence molecule comprising in the 5' to 3' direction of transcription: 

1) a first nucleic acid sequence capable of regulating 
transcription in said plant cell operatively linked to; 

2) a second nucleic acid sequence encoding a chymosin 
polypeptide operatively linked to; 

3) a third nucleic acid sequence capable of terminating 
transcription in said plant cell; 

b) growing said plant cell into a mature plant capable of 
setting seed; and 

c) obtaining said seed from said mature plant wherein the 
seed contains chymosin. 
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In a preferred embodiment, at least 0.5% (w/w) of the total seed 
protein is chymosin. More preferably at least 1% (w/w) of the total seed 
protein is chymosin, even more preferably at least 2% (w/w) of the total 
seed protein is chymosin and most preferably at least 4% (w/w) of the total 
5 seed protein is chymosin. 

As used herein the term "chymosin polypeptide" refers to all 
chymosins and includes pre-pro-chymosin and pro-chymosin 
polypeptides. The chymosin is preferably mammalian such as bovine, 
goat and sheep chymosin. 
10 The term "nucleic acid sequence encoding a chymosin 

polypeptide" refers to all nucleic acid sequence encoding chymosin and all 
nucleic acid sequences that hybridize thereto under stringent hybridization 
conditions or would hybridize thereto but for the use of synonymous 
codons. 

15 Appropriate "stringent hybridization conditions" which 

promote DNA hybridization are known to those skilled in the art, or may 
be found in Current Protocols in Molecular Biology, John Wiley & Sons, 
N.Y. (1989), 6.3.1-6.3.6. For example, the following may be employed: 6.0 x 
sodium chloride/sodium citrate (SSC) at about 45°C, followed by a wash of 

20 2.0 x SSC at 50°C. The stringency may be selected based on the conditions 
used in the wash step. For example, the salt concentration in the wash 
step can be selected from a high stringency of about 0.2 x SSC at 50°C. In 
addition, the temperature in the wash step can be at high stringency 
conditions, at about 65*C. 

25 The term "nucleic acid sequence encoding a chymosin 

polypeptide" includes nucleic sequences that encode pre-pro-chymosin 
and pro-chymosin. In addition, the nucleic acid sequences that encode 
chymosin may be linked to additional nucleic acid sequences such as those 
that encode signal peptides. 

30 In preferred embodiments of the present invention, nucleic 

acid sequences encoding bovine chymosin A or chymosin B are used (Moir 
et al. (1982) Gene 19: 127-138.; Harris et al. (1982) Nucleic Acids Res. 10: 
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2177-2187). In further preferred embodiments nucleic acid sequences 
encoding chymosin are used which have been optimized for codon usage 
in plants. The natural bovine chymosin sequence has a GC content of 56% 
with a preference for a G or C in the third position of the codon. This 
5 differs from the codon bias for cattle which has an average GC content of 
39% (Mishimori et al. (1982) J Biochem 91: 1085-1088). In a preferred 
embodiment, the codon usage of chymosin is manipulated to reflect a 
codon usage typical of seed-storage proteins found in oilseeds, for example 
using a GC content of 49% with a preference for a G or C in the third 
10 position of the codon (see Example 1). 

The invention further includes the use of nucleic acid 
sequences encoding chymosin precursor proteins that can be activated, for 
example by treating the precursor polypeptide at low pH, to exhibit 
chymosin activity. Nucleic acid sequences encoding chymosin precursor 
15 proteins that may be used in accordance with the present invention 
include naturally occurring nucleic acid sequences encoding chymosin 
precursor proteins, such as "pro-chymosin", "pre-chymosin" and 
"pre-pro-chymosin", as well as non-naturally occurring nucleic acid 
sequences encoding precursor proteins comprising chymosin and capable 
20 of activation to exhibit chymosin activity. In a preferred embodiment of 
the invention, a nucleic acid sequence encoding bovine pro-chymosin 
comprising 42 extra amino acid residues is used (Moir et al. (1982) Gene 19: 
127-138.; Harris et al. (1982) Nucleic Acids Res.10: 2177-2187). Other nucleic 
acid sequences encoding precursor proteins that may be used in accordance 
25 with the present invention include those encoding bovine 
pre-pro-chymosin comprising 58 extra amino acid residues (Moir et al. 
(1982) Gene 19: 127-138.; Harris et al. (1982) Nucleic Acids Res.10: 
2177-2187), and nucleic acid sequences encoding plant signal sequences 
capable of targeting chymosin to a preferred subcellular compartment, for 
30 example the plant apoplast, the golgi apparatus or cytoplasm. In one 
preferred embodiment, the nucleic acid sequence encoding chymosin 
comprises a nucleic acid sequence encoding the tobacco pathogenesis 
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related protein-S (PR-S) signal sequence (Sijmons et al. (1990) 
Bio/ technology 8: 217-221.) directing targeting to the plant apoplast linked 
to a nucleic acid sequence encoding a bovine pro-chymosin polypeptide 
sequence (Figure 1 and SEQ.ID.l). Other naturally occurring signal 
5 sequences that could be used in accordance with the present invention 
include for example the barley alpha amylase signal sequence (Rogers 
(1985) J. Biol. Chem. 260(6): 3731-3738) directing targeting of the chymosin 
sequence to the apoplast. The nucleic acid sequences encoding additional 
peptide sequences may be homologous as well as heterologous with 
10 respect to the nucleic acid sequence encoding the chymosin polypeptide. 
The nucleic acid sequence encoding the additional peptide sequences, such 
as the pro-peptide, pre-pro-peptide or pre-peptide, may vary in length and 
are preferably codon-optimized for use in plants. 

In embodiments of the invention involving the activation of a 
15 chymosin precursor protein, the activation reaction may be performed 
upon obtaining the plant seeds by for example treating an extracted seed 
fraction at low pH, preferably at pH values lower than 5, or the activation 
reaction may take place in planta. It is also possible to complete the 
activation reaction in a mixture comprising chymosin precursor 
20 polypeptides and enzymatically active chymosin. The chymosin precursor 
protein may be partially active or exhibiting no chymosin activity, 
however the precursor protein is typically not fully active. 

Nucleic acid sequences encoding chymosin are readily available 
or obtainable by the skilled artisan based on chymosin nucleic acid 
25 sequences and /or amino acid sequences known in the art. The bovine 
nucleic acid and amino acid sequences for chymosin A and chymosin B for 
example, are known and may be directly used in accordance with the 
present invention. As well, the complete primary structure of lamb 
preprochymosin has been deduced from cDNA (Pungercar et al. (1990) 
30 Nucleic Acids Res. 18(15): 4602). These known chymosin nucleic acid 
sequences may also be used to design and construct probes to identify 
previously undiscovered nucleic acid sequences encoding chymosin. 
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These probes may be used to isolate nucleic acid sequence encoding 
chyrnosin from for example cDNA or genomic libraries. The nucleic acid 
sequence encoding chyrnosin is preferably obtained from a mammal. 
Thus additional nucleic acid sequence chyrnosin sequences may be 
5 discovered and used in accordance with the present invention. 

The term "nucleic acid sequence" as used herein refers to a 
sequence of nucleotide or nucleoside monomers consisting of naturally 
occurring bases, sugars and intersugar (backbone) linkages. The term also 
includes modified or substituted sequences comprising non-naturally 
10 occurring monomers or portions thereof, which function similarly. The 
nucleic acid sequences of the present invention may be ribonucleic (RNA) 
or deoxyribonucleic acids (DNA) and may contain naturally occurring 
bases including adenine, guanine, cytosine, thymidine and uracil. The 
sequences may also contain modified bases such as xanthine, 
15 hypoxanthine, 2-aminoadenine, 6-methyl, 2-propyl, and other aikyl 
adenines, 5-halo uracil, 5-halo cytosine, 6-aza uracil, 6-aza cytosine and 
6-aza thymine, pseudo uracil, 4-thiouracil, 8-halo adenine, 8-amino 
adenine, 8-thiol adenine, 8-thio-alkyl adenines, 8-hydroxyl adenine and 
other 8-substituted adenines, 8-halo guanines, 8-amino guanine, 8-thiol 
20 guanine, 8-thioalkyl guanines, 8-hydroxyl guanine and other 8-substituted 
guanines, other aza and deaza uracils, thymidines, cytosines, adenines, or 
guanines, 5-trifluoromethyl uracil and 5-trifluoro cytosine. 

In accordance with the present invention, the chimeric nucleic 
acid sequences can be incorporated in a known manner in a recombinant 
25 expression vector which ensures good expression in a plant seed. 
Accordingly, the present invention includes a recombinant expression 
vector comprising a chimeric nucleic acid sequence of the present 
invention suitable for expression in a seed cell. 

The term "suitable for expression in a seed cell" means that the 
30 recombinant expression vectors contain the chimeric nucleic acids 
sequence of the invention, a regulatory region and a termination region, 
selected on the basis of the seed cell to be used for expression, which is 
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operatively linked to the nucleic acid sequence encoding the polypeptide of 
desirable amino acid composition. Operatively linked is intended to mean 
that the chimeric nucleic acid sequence encoding the polypeptide is linked 
to a regulatory sequence and termination region which allows expression 
5 in the seed cell. A typical construct consists, in the 5' to 3' direction of a 
regulatory region complete with a promoter capable of directing expression 
in a plant, a chymosin coding region and a transcription termination 
region functional in plant cells. These constructs may be prepared in 
accordance with methodology well known to those of skill in the art of 
10 molecular biology (see for example: Sambrook et al. (1990) Molecular 
Cloning, 2nd ed. Cold Spring Harbor Press). The preparation of constructs 
may involve techniques such as restriction digestion, ligation, gel 
electrophoresis, DNA sequencing and PCR. A wide variety of cloning 
vectors are available to perform the necessary cloning steps. Especially 
15 suitable for this purpose are the cloning vectors with a replication system 
that is functional in Escherichia coli such as pBR322, the pUC series 
M13mp series, pACYC184, pBluescript etc. The nucleic acid sequence may 
be introduced into these vectors and the vectors may be used to transform 
E. coli which may be grown in an appropriate medium. Plasmids may be 
20 recovered from the cells upon harvesting and lysing the cells. Final 
constructs may be introduced into plant vectors compatible with 
integration into the plant such as the Ti and Ri plasmids. 

The selection of regulatory sequences will determine the plant 
organ in which the protein is expressed and may influence the level that a 
25 gene will be transcribed. Regulatory sequences are art-recognized and are 
selected to direct expression in the plant cell. Accordingly, the term 
"regulatory sequence" includes promoters, enhancers, ribosome binding 
sites, introns and other expression elements. Examples of promoters 
include both non-seed specific, constitutive promoters such as the 35-S 
30 CaMV promoter (Rothstein et al. (1987) Gene 53: 153-161) and seed specific 
promoters such as the phaseolin promoter (Sengupta-Gopalan et aL, (1985) 
PNAS USA 82: 3320-3324) or the Arabidopsis 18 kDa oleosin promoter 
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(van Rooijen et al., (1992) Plant Mol. Biol. 18: 1177-1179). In preferred 
embodiments of the present invention, seed specific promoters are 
employed and more specifically the phaseolin promoter. Enhancers which 
may be used include the AMV leader (Jobling and Gehrke (1987) Nature 
5 325: 622-625) to increase the expression levels. It should be understood that 
the design of the expression vector may depend on such factors as the 
choice of the plant species and/or the type of polypeptide to be expressed. 

The region containing the transcriptional terminator sequence 
preferably includes from about 200 to about 1,000 nucleotide base pairs and 
10 may comprise any such sequences functional in plants, such as the 
nopaline synthase termination region (Bevan et al., (1983) Nucl. Acid. Res. 
11: 369-385), the phaseolin terminator (Van der Geest et al., (1994) Plant J. 
6(3): 413-423), the terminator for the octopine synthase gene of 
Agrobacterium tumefaciens or other similarly functioning elements. 
15 These transcription terminator regions can be obtained as described by An 
(1987) Methods in Enzym. 153: 292 or are already present in plasmids 
available from commercial sources such as ClonTech, Palo Alto, 
California. The choice of the appropriate terminator may have an effect of 
the rate of transcription. In preferred embodiments of the invention the 
20 phaseolin terminator is employed. 

The expression vectors may also contain a marker gene. 
Marker genes comprise all genes that enable distinction of transformed 
plant cells from non- transformed cells, including selectable and screenable 
marker genes. Conveniently, a marker may be a resistance marker to a 
25 herbicide, for example, glyphosate or phosphinothricin, or to an antibiotic 
such as kanamycin, G418, bleomycin, hygromycin, chloramphenicol and 
the like, which confer a trait that can be selected for by chemical means. 
Resistance markers to a herbicide when linked in close proximity to the 
chymosin gene may be used to maintain selection pressure on a 
30 population of transgenic plants for those plants that have not lost the gene 
of interest- Screenable markers may be employed to identify transformants 
through observation. They include but are not limited to the 
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beta-glucuronidase or uidA gene, a beta-lactamase gene or a green 
fluorescent protein (Niedz et al. (1995) Plant Cell Rep. 14: 403). 

A variety of techniques are available for the introduction of 
nucleic acid sequences, in particular DNA, into plant host cells. For 
example, the chimeric DNA constructs may be introduced into host cells 
obtained from dicotelydenous plants, such as tobacco, and oleagenous 
species, such as Brassica napus using standard Agrobacterium vectors by a 
transformation protocol such as described by Moloney et al. (1989) Plant 
Cell Rep. 8: 238-242 or Hinchee et al. (1988) Bio/Technol. 6: 915-922; or 
other techniques known to those skilled in the art. For example, the use of 
T-DNA for transformation of plant cells has received extensive study and 
is amply described in EP 0 120 516, Hoekema et al., (1985), Chapter V In: 
The Binary Plant Vector System Offset-drukkerij Kanters BV, 
Alblasserdam); Knauf et al. (1983), Genetic Analysis of Host Expression by 
Agrobacterium, p. 245, In: Molecular Genetics of Bacteria-Plant Interaction, 
Puhler, A. ed. Springer-Verlag, NY); and An et al., (1985) EMBO J., 4: 
277-284. Agrobacterium transformation may also be used to transform 
monocot plant species (US Patent 5,591,616). 

Conveniently, explants may be cultivated with Agrobacterium 
tumefaciens or Agrobacterium rhizogenes to allow for the transfer of the 
transcription construct in the plant host cell. Following transformation 
using Agrobacterium the plant cells are dispersed into an appropriate 
medium for selection, subsequently callus, shoots and eventually plants 
are recovered. The Agrobacterium host will harbour a plasmid 
comprising the vir genes necessary for transfer of the T-DNA to plant cells. 
For injection and electroporation (see below) disarmed Ti-plasmids 
(lacking the tumour genes, particularly the T-DNA region) may be 
introduced into the plant cell. 

The use of non-Agrobacterium techniques permits the use of 
constructs described herein to obtain transformation and expression in a 
wide variety of monocotyledenous and dicotelydenous plant species. 
These techniques are especially useful for transformation of plant species 
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that are intractable in an Agrobacterium transformation system. Other 
techniques for gene transfer include particle bombardment (Sanford, (1988) 
Trends in Biotechn. 6: 299-302), electroporation (Fromm et aL, (1985) PNAS 
USA, 82: 5824*5828; Riggs and Bates, (1986) PNAS USA 83: 5602-5606), PEG 
5 mediated DNA uptake (Potrykus et aL, (1985) Mol. Gen. Genetics., 199: 
169-177), microinjection (Reich et aL, Bio/Techn. (1986) 4:1001-1004) and 
silicone carbide whiskers (Kaeppler et al. (1990) Plant Cell Rep. 9: 415-418). 

In a specific application such as to B. napus, the host cells 
targeted to receive recombinant DNA constructs typically will be derived 
10 from cotyledonary petioles as described by Moloney et al. (1989) Plant Cell 
Rep. 8: 238-242. Other examples using commercial oil seeds include 
cotyledon transformation in soybean explants (Hinchee et al., (1988) 
Bio/Technol. 6: 915-922 and stem transformation of cotton (Umbeck et aL, 
(1987) Bio/Technol. 5: 263-266). 
15 Following transformation, the cells, for example as leaf discs, 

are grown in selective medium. Once the shoots begin to emerge, they are 
excised and placed onto rooting medium. After sufficient roots have 
formed, the plants are transferred to soil. Putative transformed plants are 
then tested for presence of a marker. Southern blotting may be performed 
20 on genomic DNA using an appropriate probe, to show integration into the 
genome of the host cell. 

Transformed plants grown in accordance with conventional 
agricultural practices, are allowed to set seed. See, for example, McCormick 
et al. (1986) Plant Cell Reports 5: 81-84. The chymosin expression level that 
25 is attained in accordance with the present invention, is generally expected 
to vary somewhat depending on the transformed plant that is assayed. As 
hereinbefore mentioned for the process to be economically attractive, a 
minimum expression level is required. The terms "commercial" and 
"commercial levels" as used herein denote an expression level wherein at 
30 least 0.5% (w/w) and more preferably more than 2% (w/w) and most 
preferably more than 4% (w/w) of total seed protein is chymosin. 
Preferably expression levels are determined using quantitative Western 
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blotting using the methodology described in detail in Example 2. 
Accordingly, typically a variety of transformed plants are screened and the 
expression level of chymosin in seed is determined. It is expected that 
typically between 5 and 50 plants may need to be screened to identify at 
5 least one plant expressing commercial levels of chymosin. Seeds obtained 
from plants expressing commercial levels of chymosin (i.e. at least 0.5% 
(w/w) of the total seed protein) are selected for further propagation. 

Accordingly, the present invention provides a method for the 
production of plant seeds containing at least 0.5% ((w/w) chymosin in the 
10 total seed protein comprising: 

(a) introducing into each of at least two plant cells a chimeric 
nucleic acid sequence molecule comprising in the 5' to 3' direction of 
transcription: 

1) a first nucleic acid sequence capable of regulating 
15 transcription in said plant cell operatively linked to; 

2) a second nucleic acid sequence encoding a chymosin 
polypeptide operatively linked to; 

3) a third nucleic acid sequence capable of terminating 
transcription in said plant cell; 

20 (b) growing each plant cell into a mature plant capable of 

setting seed; 

(c) obtaining seed from each mature plant; 

(d) detecting the levels of chymosin in the seed of each plant 
obtained in step (c) or in the seed of a plant generated from the seed of a 

25 plant obtained in step (c); and 

(e) selecting plants that contain at least 0.5% (w/w) chymosin 
in the total seed protein. 

Chymosin activity can be assayed by spectrophotometric or 
fluorometric methods or by milk-clotting assays. In the milk-clotting 
30 assay, a diluted sample is added to a milk solution so that the final 
solution contains 8% skim milk and 0.05% CaCl 2 in water. The clotting 
time or flake point is measured as the time it takes for the thin film of 
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milk to break into visible particles. The concentration of chymosin is 
determined by comparing to a linear standard plotted as clotting time in 
seconds against the chymosin concentration (Tsuchiya et al. (1993) Appl. 
Microbiol. BiotechnoL 40: 327-332). 
5 Two or more generations of plants may be grown and either 

crossed or selfed to allow identification of plants and strains with desired 
phenotypic characteristics including production of the recombinant 
polypeptide. It may be desirable to ensure homozygosity in the plants to 
assure continued inheritance of the recombinant trait. Methods for 

10 selecting homozygous plants are well known to those skilled in the art of 
plant breeding and include recurrent selfing and selection and anther and 
mircospore culture. Homozygous plants may also be obtained by 
transformation of haploid cells or tissues followed by regeneration of 
haploid plantlets subsequently converted to diploid plants by any number 

15 of known means (e.g. treatment with colchicine or other microtubule 
disrupting agents). 

The present invention also provides plant seeds expressing 
chymosin. In a preferred embodiment of the present invention the plant 
seeds comprise a chimeric nucleic acid sequence comprising in the 5' to 3' 

20 direction of transcription: 

1) a first nucleic acid sequence capable of regulating 
transcription in said plant cell operatively linked to; 

2) a second nucleic acid sequence encoding a chymosin 
polypeptide operatively linked to; 

25 3) a third nucleic acid sequence capable of terminating 

transcription in said plant cell, wherein the seed contains 
chymosin. 

In a further aspect the present invention provides plants 
capable of setting seed expressing chymosin. In a preferred embodiment of 
30 the invention, the plants capable of setting seed comprise a chimeric 
nucleic acid sequence comprising in the 5' to 3' direction of transcription: 
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1) a first nucleic acid sequence capable of regulating 
transcription in said plant cell operatively linked to; 

2) a second nucleic acid sequence encoding a chymosin 
polypeptide operatively linked to; 

5 3) a third nucleic acid sequence capable of terminating 

transcription in said plant cell, wherein the seed contains 
chymosin. 

The methods disclosed in the present invention can be used 
over a broad range of plant species. Particularly preferred plant cells 

10 employed in accordance with the present invention include cells from the 
following plants: soybean (Glycine max), rapeseed (Brassica napus, Brassica 
campestris), sunflower (Helianthus annuus), cotton (Gossypium 
hirsutum), corn (Zea mays), tobacco (Nicotiana tobacum), alfalafa 
(Medicago sativa), wheat (Triticum sp.), barley (Hordeum vulgare), oats 

15 (Avena saliva L.), sorghum (Sorghum bicolor), Arabidopsis thaliana, 
potato (Solatium sp.), flax/linseed (Linum usitatissimum), safflower 
(Carthamus tinctorius), oil palm (Eleais guineeis), groundnut (Arachis 
hypogaea), Brazil nut (Bertholletia excelsa) coconut (Cocus nucifera), castor 
(Ricinus communis), coriander (Coriandrum sativum), squash (Cucurbita 

20 maxima), jojoba (Simmondsia chinensis) and rice (Oryza sativa). 

The invention also provides a method for recovering 
chymosin from a plant seed comprising: 

a) introducing into a plant cell a chimeric nucleic acid 
sequence molecule comprising in the 5' to 3' direction of transcription: 

25 1) a first nucleic acid sequence capable of regulating 

transcription in said plant cell operatively linked to; 

2) a second nucleic acid sequence encoding a chymosin 
polypeptide operatively linked to; 

3) a third nucleic acid sequence capable of terminating 
30 transcription in said plant cell; 

b) growing said plant cell into a mature plant capable of setting 

seed; 



WO 01/14571 PCT/CA00/00975 

-20- 

c) obtaining seed from the mature plant wherein said seed 
contains chymosin; and 

d) isolating said chymosin from said seed. 

In preferred embodiments, isolation of chymosin from seed 

5 comprises: 

i) crushing the plant seed to obtain crushed plant seed; 

ii) contacting the crushed plant seed or a fraction thereof 
with a protein binding resin; and 

iii) recovering chymosin from the protein binding resin. 

10 The term "crushing" as used herein refers to any process or 

methodology to comminute seed and includes mechanical pressing, 
grinding, crushing processes and the like. Preferably the seeds are ground 
using a mill such as for example a colloid mill, a disk mill, a pin mill, an 
orbital mill, an IKA mill, a homogenizer or similar equipment. The 
15 selection of the crushing equipment depends inter alia on the throughput 
requirements and on the seed source. Typically the crushing conditions 
selected result in the breakage of individual seed cells. It is of importance 
however that the chymosin polypeptide remains intact. Crushing 
conditions that would substantially inactivate the enzyme are undesirable 
20 in the practice of the present invention. The crushing process practiced in 
accordance with the present invention permits the recovery of a crushed 
plants seeds comprising chymosin. 

The crushing process may be carried out using dry seed. 
Preferably however the seeds are crushed in the presence of water or a 
25 buffer. Prior to, during or after the crushing process, additional water or a 
buffer may be employed to dilute the seed extract. Preferably the crushed 
seed fraction obtained is between 2 and 100 fold diluted relative to the 
original seed volume. Furthermore the salt concentration may be 
adjusted by the addition of extraneous salts or salt solutions to the crushed 
30 seeds. Accordingly, preferably the extraneous salt concentration of the 
crushed seed that is obtained is preferably between approximately 0.1M and 
2M. Suitable salts to adjust the salt concentration in accordance with the 
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present invention include sulfate salts for example sodium sulfate, 
magnesium sulfate, and ammonium sulfate; phosphate salts, for example 
sodium phosphate, magnesium phosphate and ammonium phosphate; 
chloride salts, for example sodium chloride and calcium chloride; and 
5 mixtures thereof. A preferred salt used in accordance with the present 
invention is sodium chloride. 

Upon crushing of the seed it is generally preferable to prepare 
an aqueous fraction of the crushed plant seeds by the removal of the 
insoluble material and the oil fraction of the seed. The insoluble material 
10 is substantially insoluble or in an insolublized association with insoluble 
material produced upon crushing of the plant seed material. The 
insoluble material is either produced in the plant seed or may be associated 
with the plant seed in the form of insoluble aggregates including, seed 
hulls, fibrous material, carbohydrates or external contaminants such as soil 
15 particles and the like. The process permits the separation of soluble seed 
material from insoluble seed material. Any suitable methodology may use 
be accomplished using any methodology that allows the separation of the 
seed insoluble material from the soluble seed constituents, including for 
example gravitation based methods such as for example centrifugation or 
20 size exclusion based methods such as filtration. In a preferred 
embodiment of the present invention centrifugation is used. 
Centrifugation equipment that may be used in accordance with the present 
invention includes a tubular bowl centrifuge, a decantation centrifuge, a 
hydrocycione, a disk stack centrifuge, and the like. 
25 Removal of the oil fraction is particularly desirable when 

chymosin is produced in seeds comprising a relatively high oil content 
such as rapeseed, flax, sunflower seed and the like. Any suitable 
methodology may be used that allows the separation of the oil fraction 
from the aqueous fraction of the seed, including for example gravitation 
30 based methods such as for example centrifugation or size exclusion based 
methods such as filtration. In a preferred embodiment of the present 
invention centrifugation is used. Centrifugation equipment that may be 
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used in accordance with the present invention includes a tubular bowl 
centrifuge, a decantation centrifuge, a hydrocyclone, a disk stack centrifuge, 
and the like. 

Generally the solids are removed prior to the oil fraction, 
5 however in other embodiments of the invention the removal of insoluble 
seed constituents and the oil fraction is accomplished concomitantly using 
a gravity based method such as a 3-phase tubular bowl centrifuge or 
decanter or a size-exclusion based separation method. 

In a further preferred embodiment selective precipitation of 

10 the crushed plant seed extract or fraction thereof may be performed prior 
to contacting the plant seed extract or fraction thereof with the protein 
binding resin. This selective precipitation step is preferably accomplished 
by selecting any conditions that allow the precipitation of at least 50% 
(w/w) of the endogenous seed proteins while substantially all chymosin 

15 remains soluble. With the term "substantially all" it is meant that at least 
approximately 75% (w/w) of all chymosin remains soluble. In a more 
preferred embodiment at least 85% (w/w) of all chymosin remains soluble. 
In the most preferred embodiment at least approximately 90% (w/w) of all 
chymosin remains soluble. In preferred embodiments of the present 

20 invention precipitation is accomplished by adjusting the pH of the crushed 
seed extract. The pH of the crushed seed is preferably adjusted to a pH of 
less than approximately 5.5. More preferably the pH is adjusted to a pH of 
between approximately 1.5 and 3.5. Most preferably the pH is adjusted to a 
pH of approximately 2.0. Any suitable acid my be used to adjust the pH, 

25 such as hydrochloric acid, sulfuric acid, phosphoric acid and the like 
preferably having a pH of less than 2. The precipitation step may take 
place concommitantly with the crushing step. In preferred embodiments, 
the precipitation step is performed subsequent to the seed-crushing step. 
Furthermore the precipitation may be performed prior to or subsequent to 

30 either the removal of the insoluble material or removal of the oil fraction. 
It is preferred however to remove the insoluble material and the oil 
fraction prior to selective precipitation. 
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The term "protein binding resin" means any resin that is 
capable of binding to proteins, in particular chymosin. In a preferred 
embodiment, the protein binding resin is a hydrophobic interaction resin. 

The present inventors have found that a hydrophobic 
5 interaction resin is particularly useful in isolating chymosin from plant 
seeds. A "hydrophobic interaction resin" refers to any protein compatible 
resin capable of differentially binding proteins present in a mixture of 
proteins, said differential binding occurring as a result of differences in 
hydrophobic characteristics of the proteins present in the mixture. 

10 Hydrophobic interaction resins are generally art-recognized and include 
for example sepharose resins having functional groups such as alkyl 
groups (e.g. butyl-sepharose, octyl-sepharose) and phenyl groups (e.g. 
phenyl-sepharose) and superose resins having functional groups such as 
alkyl groups and phenyl groups. The hydrophobic interaction resin may 

15 be used batch-wise or prepared for column chromatography. 

In the practice of the present invention the crushed seed extract 
or a fraction thereof comprising chymosin is contacted with the 
hydrophobic interaction resin under conditions that, will permit chymosin 
to bind to the hydrophobic interaction resin. Preferred binding conditions 

20 in accordance with the present invention are conditions of high ionic 
strength, for example 1M to 2M salt concentrations, e.g. 1.5M ammonium 
sulphate. Other salts that may be used in accordance with the present 
invention include sulfate salts for example magnesium sulfate; phosphate 
salts, for example sodium phosphate, magnesium phosphate and 

25 ammonium phosphate; chloride salts, for example sodium chloride and 

> 

calcium chloride; and mixtures thereof. Once binding has been 
accomplished conditions are altered so that the bound substances are 
eluted differentially thus allowing the recovery of chymosin from the 
hydrophobic interaction column. Preferably the ionic strength is altered 
30 to accomplish elution, for example the ionic strength is reduced from 1.5 
M to 0.5 M. The changes in conditions may be performed stepwise or 
gradually. Other elution methodologies that may be employed include 
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reducing the eluent polarity for example using a glycol gradient up to 50%, 
adding chaotropic species such as urea, guanidine hydrochloride; the 
addition of detergents; changing pH or temperature. 

In further preferred embodiments, chymosin is additionally 
5 purified by employing an ion exchange resin. An "ion exchange resin" 
refers to any protein compatible resinous material which is capable of 
binding charged compounds. Ion exchange columns are art recognized 
and include anion and cation exchange resins. These resins may be 

* 

employed in a batch fashion or as a column. Preferred cation exchange 
10 columns for use in the present invention, include for example Pharmacia 
SP-Spehadex, Indion SP-2, IBF SP-Triacryl, IBF SP-Spherodex and the like. 
Preferred anion exchange resins in this regard are DEAE cellulose, IBF Q 
Spherodex, Pharmacia Q-Sephadex, Indion Q-2, IBF Q-Trisacryl and the 
like. In the practice of the present invention the aqueous solution of 
15 comprising chymosin is contacted with the ion-exchange resin under 
conditions at which the chymosin will bind to the resin. Whether 
chymosin binds to the resin depends on the pH of the aqueous solution, 
i.e. whether the pH is below or above the isoelectric point of chymosin 
(approximately 4.6). Accordingly, contacting the aqueous solution 
20 comprising chymosin under conditions at which chymosin will bind to 
the column refers to adjusting the pH of the solution above or below its 
isoelectric point so that it will bind to the selected resin. Binding of 
chymosin to the resin further depends on the ionic strength. Accordingly, 
the salt concentration may vary, for example a concentration of less than 
25 250mM NaCl may be used. In order to elute chymosin of the resin 
conditions are selected which permit the elution of chymosin from the 
resin, preferably the ion concentration is adjusted to elute the chymosin of 
the resin. For example the salt concentration may be adjusted to a 
concentration of 2M NaCl. The pH and salt concentration of the chymosin 
30 preparation thus recovered may be adjusted as desired. The ion exchange 
resin step may be employed either prior or after the hydrophobic 
interaction step. 
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Optionally the chymosin preparation may be concentrated 
using for example ultrafiltration or treated for longer-term preservation 
using any suitable preservation methodology. For example the chymosin 
preparation may be sterilized using methodologies such as filtration or 
5 ultrafiltration. 

Optionally the chymosin preparation may be concentrated 
using for example ultrafiltration or treated for longer-term preservation 
using any suitable preservation methodology. For example the chymosin 
preparation may be sterilized using methodologies such as filtration or 
10 ultrafiltration. 

The following non-limiting examples are illustrative of the 
present invention: 

EXAMPLES 

EXAMPLE 1 

15 Construction of a plant transformation vector comprising of a ch imeric 
nucleic acid sequence containing pre-pro-chymosin . 

A pro-chymosin gene was re-synthesized from the bovine 
pro-chymosin to reflect the plant-preferred codons (See Figure 1 and 
SEQ.ID.NOS.: 1 and 2). Amino acids 27 to 67 are the pro-peptide sequence 

20 and amino acids 68 to 390 are the mature chymosin polypeptide. A PR-S 
signal sequence was attached to the 5' end of the pro-chymosin gene by 
PCR fusion. The PRS sequence includes amino acids 1 to 26 in Figure 1. 
The pre-pro-chymosin DNA fragment was fused in between a phaseolin 
promoter and the phaseolin terminator derived from the common bean 

25 Phaseolus vulgaris Slightom et al (1983) Proc. Natl Acad Sc USA 80: 
1897-1901). A complete sequence of the phaseolin 

promoter-preprochymosin-phaseolin terminator insert responsible for the 
expression of chymosin in plant seeds is shown in Figure 2 and 
SEQ.ID.NO.:3. This insert was cloned into the Pstl-Kpnl sites of vector 

30 pSBS2004 and pSBS3000 and resulted in plasmids pSBS2151 and pSBS2165 
respectively. pSBS2004 is a derivative from the Agrobacterium binary 
plasmid pCGN1559 (MacBride and Summerfield, 1990, Plant Molec. Biol. 
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14 269-276) in which, the CAMV 35S prornoter-neomycin 
phosphotransferase gene- tumor morphology large locus 3' antibiotic 
selection cassette of pCGN1559 was replaced with parsley ubiquitin 
promoter-phosphinothricin acy transferase gene-parsley ubiquitin 
5 termination sequence to confer resistance to the herbicide glufosinate 
ammonium. pSBS3000 is a derivative from the Agrobacterium binary 
plasmid pPZP221 (Hajdukiewicz et al., 1994, Plant Molec. Biol. 25: 989-994). 
In pSBS3000, the CaMV35S promoter-gentamycin resistance gene-CAMV 
35S terminator of pPZP221 was replaced with parsley ubiquitin 
10 promoter-phosphinothricin acetyl transferase gene-parsley ubiquitin 
termination sequence to confer resistance to the herbicide glufosinate 
ammonium. 
EXAMPLE 2 

Generation of chymosin-expressing transgenic plants 

15 Plasmids pSBS2151 and pSBS2165 were electroporated into 

Agrobacterium strain EHA101 (Hood, et al (1986) J Bacteriol 144: 732-743). 
Agrobacterium strain EHA101 (pSBS2151) was used to transform Brassica 
napus. The procedure for the transformation of Brassica has been 
essentially outlined in Moloney et al. (1989) Plant Cell Reports 8: 238-242, 

20 except phophinothricin, at a concentration of 1 to 2 mg/L, was used as the 
selectable agent, Agrobacterium strain EHA101 (pSBS2165) was used to 
transform flax cv Mc Gregor. Flax transformation was performed 
essentially as described in Jordan and McHughen (1988) Plant cell reports 7: 
281-284, except transgenic shoots were selected on 10 uM 

25 L-phosphinothricine instead of kanamycin. 
EXAMPLE 3 

Expression levels of chymosin in Brassica 

Physical characteristics of Brassica napus seed extracted 
chymosin were compared relative to commercially available bovine 
30 chymosin. The molecular weight of the two chymosin proteins was 
determined by gel electrophoresis on a 12% poly-acrylamide gel and 
Western blot analysis using a polyclonal rabbit antibody as shown in 
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Figure 3. Specified concentrations were loaded onto a 12% poly-acrylamide 
gel and transferred to a membrane. The membrane was probed with a 
polyclonal antibody raised against commercial available bovine chymosin 
and visualized using alkaline phosphatase. This polyclonal antibody is 
5 immunologically reactive with several bands in the transgenic seed 
extract. Bands of the same electroforetic mobility are found in the 
commercial bovine chymosin extract. This suggests that the majority of 
the pre-pro-chymosin in the seed extract has matured into chymosin. The 
lower molecular weight bands likely result from proteolytic digestion of 
10 the mature protein and the minor higher molecular weight bands could 
correspond to altered processed forms of either preprochymosin or 
prochymosin. The protein levels for chymosin in one of the Brassica 
plants analyzed is shown in Figure 3. Seeds were ground in water to make 
a seed extract and the protein concentration was determined as described 
15 in Bradford (1976) Anal. Biochem. 72: 248-254. Different concentrations of 
the same seed extract were electrophoresed on a gel along with a bovine 
derived chymosin standard loaded with known concentrations. Western 
blot analysis was performed with a polyclonal rabbit antibody and 
visualized using alkaline phosphatase. Quantitative densitometry was 
20 used to correlate the density of the 35.6 kDa band to the concentration of 
the protein by comparison with a standard curve derived from known 
concentrations of chymosin. Table 1 is a compilation of the data for the 
amount of chymosin in the identical seed extract of differing 
concentrations and resulting percent of expression. The slightly different 
25 levels reflect a standard error. Note that no data is provided for 4 pg and 8 
pg of seed extract as the results exceeded the saturation range of the 
densitometer. 

The biological activity of the plant (Brassica) derived chymosin 
was determined through the use of milk-clotting assays. In the 
30 milk-clotting assay, a diluted seed extract sample is added to a clotting 
substrate as described in (Tsuchiya et al. (1993) Appl. Microbiol. 
Biotechnol. 40: 327-332). Transgenic Brassica seeds had the ability to clot 
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milk whereas, seeds that were not transformed with the pro-chymosin 
gene were unable to clot milk. 
EXAMPLE 4 

Expression levels of chymosin in flax (Linutn usitatissitnum) 
5 Transgenic flax plants containing the preprochymosin gene 

were analyzed for the presence of biologically active chymosin. The 
biological activity of the plant derived chymosin was determined through 
the use of milk-clotting assays. In the milk-clotting assay, a diluted flax 
seed extract sample is added to a clotting substrate as described in 
10 (Tsuchiya et aL (1993) Appl. Microbiol. Biotechnol. 40: 327-332). The 
clotting time or flake point is measured as the time it takes for the thin 
film of milk to break into visible particles. The concentration of chymosin 
in the seed extract is determined by comparing it to a linear standard curve 
plotted as clotting time in seconds against the chymosin concentration 
15 (Tsuchiya et al. (1993) Appl. Microbiol. Biotechnol. 40: 327-332). The 
chymosin concentration was first determined as a weight percentage of 
seed weight (=W%). The percentage chymosin as a percentage of total seed 
protein was calculated by using the formula (W/ percentage protein in dry 
seed) X 100. For flax the total amount of protein as a percentage of seed 
20 weight equals approximately 20 % (Gill, 1987, Linseed, Indian Council of 
Agricultural Research Publication). Wx5 equals the expression level of 
chymosin as a percentage of total seed protein. Figure 4 shows the 
expression levels of chymosin in transgenic flax seeds as a percentage of 
total protein for selected transform ants. 
25 EXAMPLE 5 

Purification of chymosin from transgenic Brassica napus seed 

This example describes the laboratory-scale purification of 
chymosin from transgenic seed produced as described in example 2. Forty 
grams of transgenic Brassica napus seed containing recombinant chymosin 
30 was combined with 400 mis of a solution containing 250 mM NaCl. The 
mixture was ground using a polytron to produce a slurry releasing the 
chymosin into solution. This slurry was then centrifuged at 
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approximately 10,000 x g to separate it into three phases, a solid pellet 
phase of insoluble material, an upper phase of seed oil bodies and 
associated proteins and a middle aqueous phase containing the chymosin, 
soluble seed proteins and other soluble seed components. Following 

5 centrifugation, the aqueous phase was removed and clarified by filtration. 
The clarified extract was adjusted to a pH of 2.0 by addition of sulfuric acid 
and allowed to sit for several minutes and then readjusted to pH 5.6 with 
aqueous ammonia. The extract was then centrifuged at 10,000 x g to 
remove precipitated proteins and the soluble supernatant phase recovered. 

0 The low pH-treated extract was diluted with water to a conductivity of 
approximately 9.5 mmohs and then loaded on to an anion exchange 
column containing approximately 30 mis of DEAE-cellulose previously 
equilibrated with 0.5% sodium benzoate, 0.379% NaCl, pH 5.6. After 
loading, the column was washed with approximately 200 mis of 0.5% 

5 sodium benzoate, 0.379% NaCl, pH 5.6 and then eluted with 110 mis of 
0.5% sodium benzoate, 10% NaCl, pH 5.6. The eluate from the anion 
exchange step was loaded on to a gel filtration column containing G25 
sephadex (Amersham-Pharmacia) equilibrated with 25 mM sodium 
phosphate, 1 M ammonium sulfate, pH 5.6. Fifty mis of the eluate from 

10 this column was passed through a 0.22 um filter and then loaded on to a 
hydrophobic interaction column containing 4.6 mis of butyl sepharose 
(Fast Flow, Amersham-Pharmacia) previously equilibrated with 25 mM 
sodium phosphate, 1 M ammonium sulfate, pH 5.6. After loading, the 
column was washed with 20 mis of 25 mM sodium phosphate, 1 M 

'5 ammonium sulfate, pH 5.6 followed by 75 mis of 25 mM sodium 
phosphate, 0.55 M ammonium sulfate, pH 5.6. Purified chymosin was 
eluted from the column with 24 mis of 25 mM sodium phosphate, 0.1 M 
ammonium sulfate, pH 5.6. Figure 5 shows a SDS-polyacrylamide gel 
showing progressive purification of chymosin obtained from transgenic 

50 seeds of Brassica napus as described above. Lane 1, aqueous phase from 
total seed extract; lane 2 pH-treated extract; lane 3, DEAE-cellulose eluate; 
lane 4, purified chymosin eluted from butyl sepharose. 
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While the present invention has been described with reference 
to what are presently considered to be the preferred examples, it is to be 
understood that the invention is not limited to the disclosed examples. To 
the contrary, the invention is intended to cover various modifications and 
5 equivalent arrangements included within the spirit and scope of the 
appended claims. 

All publications, patents and patent applications are herein 
incorporated by reference in their entirety to the same extent as if each 
individual publication, patent or patent application was specifically and 
10 individually indicated to be incorporated by reference in its entirety. 
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TABLE 1 



mg of seed extract 


0.5 


1.0 


2.0 


ng of pro-chymosin in 
seed extract 


21 


47 


88 


level of expression (% of 
protein) 


4.2 


4.7 


4.4 


Average level of 
expression (% of protein ) 




4.43 
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We Claim : 

1. A method for the production of chymosin in a plant seed 
comprising: 

a) introducing into a plant cell a chimeric nucleic acid 
5 sequence molecule comprising in the 5' to 3' direction of transcription: 

1) a first nucleic acid sequence capable of regulating 
transcription in said plant cell operatively linked to; 

2) a second nucleic acid sequence encoding a chymosin 
polypeptide operatively linked to; 

10 3) a third nucleic acid sequence capable of terminating 

transcription in said plant cell; 

b) growing said plant cell into a mature plant capable of setting 

seed; and 

c) obtaining seed from the mature plant wherein said seed 
15 contains chymosin. 

2. The method according to claim 1 wherein said first nucleic 
sequence capable of regulating transcription in said plant cell is a 
seed-specific promoter. 

3. The method according to claim 3 wherein said seed-specific 
20 promoter is a phaseolin promoter. 

4. A method according to any one of claims 1 to 3 wherein at least 
0.5% (w/w) of the total seed protein is chymosin. 

5. The method according to any one of claims 1 to 4 wherein the 
second nucleic acid sequence encoding a chymosin polypeptide comprises 

25 a nucleic acid sequence encoding a chymosin pro-peptide, a nucleic acid 
sequence encoding a chymosin pre-peptide or a nucleic acid sequence 
encoding chymosin pre-pro-peptide. 
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6. The method according to claim 5 wherein the second nucleic 
acid sequence encoding a chymosin polypeptide further comprises a 
nucleic acid sequence encoding a plant signal sequence. 

7. The method according to any one of claims 1 to 6 wherein the 
5 second nucleic acid sequence encoding a chymosin polypeptide further 

comprises a nucleic acid sequence encoding a plant signal sequence. 

8. The method according to claim 7 wherein the plant signal 
sequence is a tobacco PR-S sequence. 

9. The method according to claim 8 wherein the nucleic acid 
10 sequence encoding chymosin linked to a PR-S signal sequence comprises a 

nucleic acid sequence as in SEQ.ID.NO.:l. 

10. The method according to any one of claims 1 to 9 wherein said 
third nucleic acid sequence is a phaseolin terminator. 

11. The method according to any one of claims 1 to 9 wherein the 
15 chymosin is a mammalian chymosin obtainable from a bovine, sheep or 

goat source. 

12. The method according to claim 5 wherein codon usage for said 
nucleic acid sequence encoding chymosin, chymosin pro-peptide, 
chymosin pre-peptide and chymosin pre-pro-peptide has been optimized 

20 for use in plants. 

13. The method according to any one of claims 1 to 12 wherein said 
plant is selected from the group of plants consisting of soybean (Glycine 
max), rapeseed (Brassica napus, Brassica campestris), sunflower 
(Helianthus annuus), cotton (Gossypium hirsutum), corn (Zea mays), 
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tobacco (Nicotiana tobacum), alfalafa (Medicago sativa), wheat (Triticum 
sp.), barley (Hordeum vulgare), oats (Avena sativa L.), sorghum (Sorghum 
bicolor), Arabidopsis thaliana, potato (Solarium sp.), flax/linseed (Linum 
usitatissimum), safflower (Carthamus tinctorius), oil palm (Eleais 
5 guineeis), groundnut (Arachis hypogaea), Brazil nut (Bertholletia excelsa) 
coconut (Cocus nucifera), castor (Ricinus communis), coriander 
(Coriandrum sativum), squash (Cucurbita maxima), jojoba (Simmondsia 
chinensis) and rice (Oryza sativa). 

14. The method according to any one of claims 1 to 13 wherein at 
10 least 1% (w/w) of said total seed protein is chymosin. 

15. The method according to any one of claims 1 to 13 wherein at 
least 2% (w/w) of said total seed protein is chymosin. 

16. The method according to any one of claims 1 to 13 wherein at 
least 4% (w/w) of said total seed protein is chymosin. 

15 17. A method for the production of plant seeds containing at least 

0.5% (w/w) chymosin in the total seed protein comprising: 

(a) introducing into each of at least two plant cells a chimeric 
nucleic acid sequence molecule comprising in the 5' to 3' direction of 
transcription: 

20 1) a first nucleic acid sequence capable of regulating 

transcription in said plant cell operatively linked to; 

2) a second nucleic acid sequence encoding a chymosin 
polypeptide operatively linked to; 

3) a third nucleic acid sequence capable of terminating 
25 transcription in said plant cell; 

(b) growing each plant cell into a mature plant capable of 
setting seed; 

(c) obtaining seed from each mature plant; 
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(d) detecting the levels of chymosin in the seed of each plant 
obtained in step (c) or in the seed of a plant generated from the seed of a 
plant obtained in step (c); and 

(e) selecting plants that contain at least 0.5% (w/w) chymosin 
5 in the total seed protein. 

18. A method according to any one of claims 1 to 16 further 
comprising (d) isolating said chymosin from said seed. 

19. A method according to claim 18 wherein (d) isolating said 
chymosin from said seed comprises: 

10 (i) crushing the plant seed to obtain crushed plant seed; 

(ii) contacting the crushed plant seed or a fraction thereof 
with a protein binding resin; and 

(iii) recovering chymosin from the protein binding resin. 

20 - A method according to claim 18 wherein (d) isolating said 

15 chymosin from said seed comprises: 

(i) crushing of the plant seed to obtain crushed plant seed; 

(ii) fractionating the crushed plant seed into an oil fraction, 
aqueous fraction and a fraction comprising insoluble 
material; 

20 (iii) contacting the aqueous fraction with a protein binding 

resin; and 

(iv) recovering the chymosin from the protein binding resin. 

21. A method according to claim 19 wherein said protein binding 

resin is a hydrophobic interaction resin. 



22. A method according to claim 20 wherein said protein binding 

resin is a hydrophobic interaction resin. 
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23 - A method according to claim 21 or 22 further comprising using 

an ion exchange resin to further purify the chymosin, 

24. Plant seed comprising at least 0.5% (w/w) heterologously 

expressed chymosin. 

5 25. Plant seed prepared according to the method of any one of 

claims 1 to 23. 

26. A plant capable of setting seed comprising at least 0.5% (w/w) 

of heterologously expressed chymosin. 

27 - A plant capable of setting seed prepared according to the 

10 method of any one of claims 1 to 23 . 
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FIGURE 1 

1 ATG AAC TTC CTT AAG TCT TTC CCT TTC TAG GCT TTC CTT TGT TTC GGT CAA TAC TTC GTT 60 

1MNFLKSFPFYAFLCFGQYFV 20 

60 GCT GTT ACT CAC GCT GCT GAG ATC ACC CGC ATT CCT CTC TAC AAA GGT AAG TCT CTC CGT 120 

21 AVTHAAEITRI PLYKGKS LR 40 

121 AAG GCG CTG AAG GAA CAT GGA CTT CTA GAA GAC TTC TTG CAG AAA CAA CAG TAT GGC ATC 180 

41KALKEHGLLEDFLQKQQYGI 60 

181 AGC AGC AAG TAC TCC GGC TTC GGT GAA GTT GCT AGC GTG CCA CTT ACC AAC TAC CTT GAT 240 

61SSKYSGFGEVASVPLTNYLD 80 

241 AGT CAA TAC TTT GGG AAG ATC TAC CTC GGA ACC CCG CCT CAA GAG TTC ACC GTT CTC TTT 300 

81 S Q Y FGKIYLGTP PQEFTVLF 100 

301 GAT ACT GGT TCC TCT GAC TTC TGG GTT CCC TCT ATC TAC TGC AAG AGC AAT GCC TGC AAG 3 60 

101 DTGSSDFWVPSI YCKSNACK 120 

361 AAC CAC CAA AGA TTC GAT CCG AGA AAG TCG TCC ACC TTC CAG AAC TTA GGC AAA CCC TTG 42 0 

121 NHQRFDPRKSSTFQNLGKPL 140 

42 0 TCT ATA CAC TAC GGT ACA GGT AGC ATG CAA GGA ATC TTA GGC TAT GAT ACC GTC ACT GTC 48 0 

141 SIHYGTGSMQGI LGYDTVTV 160 

481 TCC AAC ATT GTG GAC ATT CAA CAG ACA GTA GGA CTT AGC ACC CAA GAA CCA GGT GAT GTC 540 

161 SNIVDIQQTVGLSTQEPGDV 180 

541 TTC ACC TAT GCA GAA TTC GAT GGC ATC CTT GGT ATG GCA TAC CCA TCG CTC GCG TCA GAG 600 

181 FTYAEFDGILGMAYPSLASE 200 

601 TAC TCG ATA CCT GTG TTT GAC AAC ATG ATG AAC CGA CAC CTA GTA GCT CAA GAC TTG TTC 660 

201 YSI PVFDNMMNRHLVAQDLF 220 

661 TCG GTT TAC ATG GAC AGG AAT GGC CAG GAG AGC ATG CTC ACG CTT GGA GCT ATT GAT CCA 72 0 

221 SVYMDRNGQESMLTLGAI DP 240 

721 TCC TAC TAC ACA GGA TCT CTT CAC TGG GTT CCA GTC ACT GTG CAG CAG TAC TGG CAA TTC 7 80 

241 SYYTGS LHWVPVTVQQYWQF 260 

7 81 ACT GTG GAC AGT GTC ACC ATC AGC GGT GTG GTT GTT GCA TGT GAA GGT GGA TGT CAA GCT 840 

261 TVDSVTISGVVVACEGGCQA 280 

841 ATC TTG GAT ACC GGT ACG TCC AAG CTG GTC GGA CCT AGC AGC GAC ATT CTC AAC ATT CAG 900 

281 ILDTGTSKLVGPSSDILNIQ 300 

9 01 CAA GCT ATT GGA GCC ACA CAG AAC CAG TAC GGT GAG TTT GAC ATA GAT TGC GAC AAC CTT 9 60 

301 QAIGATQNQYGEFDIDCDNL 320 

9 61 AGC TAC ATG CCT ACA GTT GTC TTT GAG ATC AAC GGC AAG ATG TAC CCA CTG ACC CCC TCC 102 0 

321 SYMPTVVFEINGKMY PLTPS 340 
1021 GCC TAT ACC AGC CAG GAT CAA GGG TTC TGC ACC AGT GGA TTC CAG AGT GAG AAC CAT TCC 108 0 
341 AYTSQDQGFCTSGFQSENHS 360 
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FIGURE 1 cont'd 

10 81 CAG AAA TGG ATC TTG GGA GAT GTG TTC ATT CGT GAG TAC TAC AGC GTC TTT GAC AGG GCC 114 0 
361 QKWILGDVFIREYYSVFDRA 380 

1141 AAC AAC CTC GTT GGG CTA GCT AAA GCA ATC TGA 1200 
381 NNLVGLAKAI * 391 
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FIGURE 2 

1 ctgcaggaattcattgtactcccagtatcattatagtgaaagttttggctctctcgccggtggttttttacctctattta 80 
Bl aaggggttttccacctaaaaattctggtatcattctcactttacttgttactttaatttctcataatctttggttgaaat 160 
161 tatcacgcttccgcacacgatatccctacaaat ttattatttgttaaacattttcaaaccgcataaaattttatgaagtc 24 0 
241 ccgtctatctttaatgtagtctaacattttcatattgaaatatataatttacttaattttagcgttggtagaaagcataa 3 20 

3 21 agatttattcttattcttcttcatataaatgtttaatatacaatataaacaaattctttaccttaagaaggatttcccat 400 
401 tttatattctaaaaatatatttatcaaatatttttcaaccacgtaaatctcataataataagttgtttcaaaagtaataa 4 80 

4 81 aatttaactccataatttttttattcgactgatcttaaagcaacacccagtgacacaactagccatttttttctttgdat 560 
561 aaaaaaatccaattatcattgtattttttttatacaatgaaaatttcaccaaacaatcatttgtggtatttctgaagcaa 64 0 
641 gtcatgttatgcaaaattctataattcccatttgacactacggaagtaactgaagatctgcttttacatgcgagacacac 72 0 
721 cttctaaagtaattttaataatagttactatattcaagatttcatatatcaaatactcaatattacttctaaaaaattaa 800 
8 01 ttagatataattaaaatattacttttttaattttaagtttaattgttgaatttgtgactattgatttattattctactat 88 0 

8 81 gtttaaattgttttatagatagtttaaagtaaatataagtaatgtagtagagtgttagagtgttaccctaaaccataaac 96 0 

9 61 tataacatttatggtggactaattttcatatatttcttattgcttttaccttttcttggtatgtaagtccgtaactagaa 104 0 
1041 ttacagtgggttgccabggcactctgtggtcttttggttcatgcatgggtcttgcgcaagaaaaagacaaagaacaaaga 112 0 
1121 aaaaagacaaaacagagagacaaaacgcaatcacacaaccaactcaaattagtcactggctgatcaagatcgccgcgtcc 1200 
1201 atgtatgtctaaatgccatgcaaagcaacacgtgcttaacatgcactttaaatggctcacccatctcaacccacacacaa 12 80 

12 81 acacattgcctttttcttcatcatcaccacaaccacctgtatatattcattctcttccgccacctcaatttcttcacttc 13 60 

13 61 aacacacgtcaacctgcatatgcgtgtcatcccatgcccaaatctccatgcatgttccaaccaccttctctcttatataa 14 40 
1441 tacctataaatacctctaatatcactcacttctttcatcatccatccatccagagtactactactctactactataatac 15 20 

1521 cccaacccaactcatattcaatactactctact ATG AAC TTC CTT AAG TCT TTC CCT TTC TAC GCT 1586 
1 MNFLKSFPFYA 11 

1587 TTC CTT TGT TTC GGT CAA TAC TTC GTT GCT GTT ACT CAC GCT GCT GAG ATC ACC CGC ATT 164 6 
\2 F L C F G Q Y F V A V T H A A E I TRI 31 

164 7 CCT CTC TAC AAA GGT AAG TCT CTC CGT AAG GCG CTG AAG GAA CAT GGA CTT CTA GAA GAC 1706 
32PLYKGKSLRKALKEHGLLED 51 

17 07 TTC TTG CAG AAA CAA CAG TAT GGC ATC AGC AGC AAG TAC TCC GGC TTC GGT GAA GTT GCT 17 6 6 
52 FLQKQQYGI SSKYSGFGEVA 71 
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17 67 AGC GTG CCA CTT ACC AAC TAC CTT GAT AGT CAA TAC TTT GGG AAG ATC TAC CTC GGA ACC 182 6 
72SV.PLTNYLDSQYFGKIYLGT 91 

18 27 CCG CCT CAA GAG TTC ACC GTT CTC TTT GAT ACT GGT TCC TCT GAC TTC TGG GTT CCC TCT 1886 
92 PP QEFTV LFDTGS S DFWVPS 111 

18 87 ATC TAC TGC AAG AGC AAT GCC TGC AAG AAC CAC CAA AGA TTC GAT CCG AGA AAG TCG TCC 19 46 
112 IYCKSNACKNHQRFDPRKSS 131 

19 47 ACC TTC CAG AAC TTA GGC AAA CCC TTG TCT ATA CAC TAC GGT ACA GGT AGC ATG CAA GGA 20 06 
132 TFQNLGKPLSIHYGTGSMQG 151 

2 007 ATC TTA GGC TAT GAT ACC GTC ACT GTC TCC AAC ATT GTG GAC ATT CAA CAG ACA GTA GGA 2 066 

152 ILGYDTVTVSNIVDIQQTVG 171 

2067 CTT AGC ACC CAA GAA CCA GGT GAT GTC TTC ACC TAT GCA GAA TTC GAT GGC ATC CTT GGT 212 6 

172 LSTQEPGDVFTYAEFDGILG 191 

2127 ATG GCA TAC CCA TCG CTC GCG TCA GAG TAC TCG ATA CCT GTG TTT GAC AAC ATG ATG AAC 218 6 

192 MAYPSLAS EYSIPVFDNMMN 211 

2187 CGA CAC CTA GTA GCT CAA GAC TTG TTC TCG GTT TAC ATG GAC AGG AAT GGC CAG GAG AGC 224 6 

212 RHLVAQDLFSVYMDRNGQES 231 

2 2 47 ATG CTC ACG CTT GGA GCT ATT GAT CCA TCC TAC TAC ACA GGA TCT CTT CAC TGG GTT CCA 23 06 

232 MLTLGAIDPSYYTGS LHWVP 251 

2 3 07 GTC ACT GTG CAG CAG TAC TGG CAA TTC ACT GTG GAC AGT GTC ACC ATC AGC GGT GTG GTT 23 6 6 

252 VTVQQYWQFTVDSVTI SGVV 271 

23 67 GTT GCA TGT GAA GGT GGA TGT CAA GCT ATC TTG GAT ACC GGT ACG TCC AAG CTG GTC GGA 242 6 

272 VACEGGCQAILDTGTSKLVG 291 

2 4 27 CCT AGC AGC GAC ATT CTC AAC ATT CAG CAA GCT ATT GGA GCC ACA CAG AAC CAG TAC GGT 248 6 

292 PS SDI LNI QQAIGATQNQYG 311 

2 4 87 GAG TTT GAC ATA GAT TGC GAC AAC CTT AGC TAC ATG CCT ACA GTT GTC TTT GAG ATC AAC 254 6 

312 EFDI DC DNLSYMPTVVFEI N 331 

254 7 GGC AAG ATG TAC CCA CTG ACC CCC TCC GCC TAT ACC AGC CAG GAT CAA GGG TTC TGC ACC 2 60 6 

332 GKMYPLTPSAYTSQDQGFCT 351 

2 607 AGT GGA TTC CAG AGT GAG AAC CAT TCC CAG AAA TGG ATC TTG GGA GAT GTG TTC ATT CGT 2 666 

352 SGFQSENHSQKWILGDVFI R 371 

2 667 GAG TAC TAC AGC GTC TTT GAC AGG GCC AAC AAC CTC GTT GGG CTA GCT AAA GCA ATC TGA 2 72 6 

372 E Y Y S V F D R A N N L V G L A K A I * 3 91 

27 27 agcttaataagtatgaactaaaatgcatgtaggtgtaagagctcatqgagagcatggaatattgtatccgaccatgtaac 2 80 6 
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2 807 agtataataactgagctccatctcacttcttctatgaataaacaaaggatgttatgacatattaacactctatctatgca 2886 

2 8 87 ccttattgttctatgataaatttcctcttattattataaatcatctgaatcgtgacggcttatggaatgcttcaaatagt 2 96 6 

2 967 acaaaaacaaatgtgtactataagac tttctaaacaat tctaactt tagcattgtgaacgagacataagtgt taagaaga 3 04 6 

3 04 7 cataacaattataatggaagaagtttgtctccatttatatattatatattacccacttacgtattatattaggatgttaa 312 6 
312 7 ggagacataacaattataaagagagaagtttgtatccatttatatattatatactacccatttatatattatacttac.ee 3 2 0 6 
3 2 07 acttatttaatgtctttataaggtttgatccatgatatttctaatattttagttgatatgtatatgaaagggtactattt 32 86 
3 287 gaactctcttactctgtataaaggttggatcatccttaaagtgggtc tatttaattttattgc ttcttacagataaaaaa 33 66 
3 3 67 aaaa ttatgagttggt t tgataaaatat tgaaggatttaaaataataataaataataaataacata taatatatgta tat 3446 
3 44 7 aaatttattataatataacatttatctataaaaaagtaaatat tgteataaatctatacaa tcgtttagccttgctggac 352 6 
3 52 7 gactc tcaattatttaaacgagagtaaacatatttgactttt tggttatttaacaaattat ta tttaacactatatgaaa 3 606 
3 607 tttttt ttt tttatcggcaaggaaataaaattaaattaggagggacaatggtgtgtcccaatccttatacaaccaact tc 3 686 
3 687 cacaggaaggtcaggtcggggacaacaaaaaaacaggcaagggaaat ttt ttaatttgggt tgtcttgtttgctgcataa 37 66 
3767 tttatgcagtaaaacactacacataacccttttagcagtagagcaatggttgaccgtgtgc t tagcttcttttattt tat 3846 
3 847 ttttttatcagcaaagaataaataaaataaaatgagacacttcagggatgtttcaacccttatacaaaaccccaaaaaca 3 926 
3927 agtttcctagcaccctaccaactaaggtacc 3957 
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SEQUENCE LISTING 



<110> van Rooi j en, Gi j s 

Keon, Richard Glenn 
Boothe , Joseph 
Shen, Yin 

SemBioSys Genetics Inc. 

<120> Commercial Production of Chymosin in Plants 

<130> 9369-152 

<140> 
<141> 

<160> 4 

<170> Patentln Ver . 2.0 

<210> 1 
<211> 1173 
<212> DNA 
<213> Bovine 

<220> 

<221> CDS 

<222> (1) . . (1173) 

<400> 1 

atg aac ttc ctt aag tct ttc cct ttc tac get ttc ctt tgt ttc ggt 48 

Met Asn Phe Leu Lys Ser Phe Pro Phe Tyr Ala Phe Leu Cys Phe Gly 
15 10 15 

caa tac ttc gtt get gtt act cac get get gag ate acc cgc att cct 96 
Gin Tyr Phe Val Ala Val Thr His Ala Ala Glu lie Thr Arg lie Pro 

20 25 30 

etc tac aaa ggt aag tct etc cgt aag gcg ctg aag gaa cat gga ctt 144 
Leu Tyr Lys Gly Lys Ser Leu Arg Lys Ala Leu Lys Glu His Gly Leu 
35 40 45 

eta gaa gac ttc ttg cag aaa caa cag tat ggc ate age age aag tac 192 
Leu Glu Asp Phe Leu Gin Lys Gin Gin Tyr Gly He Ser Ser Lys Tyr 
50 55 60 

tec ggc ttc ggt gaa gtt get age gtg cca ctt acc aac tac ctt gat 240 
Ser Gly Phe Gly Glu Val Ala Ser Val Pro Leu Thr Asn Tyr Leu Asp 
65 70 75 80 

agt caa tac ttt ggg aag ate tac etc gga acc ccg cct caa gag ttc 288 
Ser Gin Tyr Phe Gly Lys He Tyr Leu Gly Thr Pro Pro Gin Glu Phe 

85 90 95 

acc gtt etc ttt gat act ggt tec tct gac ttc tgg gtt ccc tct ate 336 
Thr Val Leu Phe Asp Thr Gly Ser Ser Asp Phe Trp Val Pro Ser He 

100 ' 105 ~ 110 

tac tgc aag age aat gec tgc aag aac cac caa aga ttc gat ccg aga 3 84 
Tyr Cys Lys Ser Asn Ala Cys Lys Asn His Gin Arg Phe Asp Pro Arg 
115 120 125 

aag teg tec acc ttc cag aac tta ggc aaa ccc ttg tct ata cac tac 432 
Lys Ser Ser Thr Phe Gin Asn Leu Gly Lys Pro Leu Ser He His Tyr 
130 135 140 
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ggt aca ggt age atg caa gga ate tta ggc tat gat acc gtc act gtc 480 
Gly Thr Gly Ser Met Gin Gly lie Leu Gly Tyr Asp Thr Val Thr Val 
145 150 155 160 

tec aac att gtg gac att caa cag aca gta gga ctt age acc caa gaa 528 
Ser Asn lie Val Asp lie Gin Gin Thr Val Gly Leu Ser Thr Gin Glu 

165 170 175 

cca ggt gat gtc ttc acc tat gca gaa ttc gat ggc ate ctt ggt atg 576 
Pro Gly Asp Val Phe Thr Tyr Ala Glu Phe Asp Gly lie Leu Gly Met 

180 185 190 

gca tac cca teg etc gcg tea gag tac teg ata cct gtg ttt gac aac 624 
Ala Tyr Pro Ser Leu Ala Ser Glu Tyr Ser lie Pro Val Phe Asp Asn 
195 200 205 

atg atg aac cga cac eta gta get caa gac ttg ttc teg gtt tac atg 672 
Met Met Asn Arg His Lau Val Ala Gin Asp Leu Phe Ser Val Tyr Met 
210 215 220 

gac agg aat ggc cag gag age atg etc acg ctt gga get att gat cca 720 
Asp Arg Asn Gly Gin Glu Ser Met Leu Thr Leu Gly Ala lie Asp Pro 
225 230 235 240 

tec tac tac aca gga tct ctt cac tgg gtt cca gtc act gtg cag cag 768 
Ser Tyr Tyr Thr Gly Ser Leu His Trp Val Pro Val Thr Val Gin Gin 

245 250 255 

tac tgg caa ttc act gtg gac agt gtc acc ate age ggt gtg gtt gtt 816 
Tyr Trp Gin Phe Thr Val Asp Ser Val Thr lie Ser Gly Val Val Val 

260 265 270 

gca tgt gaa ggt gga tgt caa get ate ttg gat acc ggt acg tec aag 864 
Ala Cys Glu Gly Gly Cys Gin Ala lie Leu Asp Thr Gly Thr Ser Lys 
275 ~ 280 285 

ctg gtc gga cct age age gac att etc aac att cag caa get att gga 912 
Leu Val Gly Pro Ser Ser Asp lie Leu Asn lie Gin Gin Ala lie Gly 
290 295 300 

gee aca cag aac cag tac ggt gag ttt gac ata gat tgc gac aac ctt 960 
Ala Thr Gin Asn Gin Tyr Gly Glu Phe Asp lie Asp Cys Asp Asn Leu 
305 310 315 320 

age tac atg cct aca gtt gtc ttt gag ate aac ggc aag atg tac cca 1008 
Ser Tyr Met Pro Thr Val Val Phe Glu He Asn Gly Lys Met Tyr Pro 

325 330 335 

ctg acc ccc tec gee tat acc age cag gat caa ggg ttc tgc acc agt 1056 
Leu Thr Pro Ser Ala Tyr Thr Ser Gin Asp Gin Gly Phe Cys Thr Ser 

340 345 350 

gga ttc cag agt gag aac cat tec cag aaa tgg ate ttg gga gat gtg 1104 
Gly Phe Gin Ser Glu Asn His Ser Gin Lys Trp He Leu Gly Asp Val 
355 360 365 

ttc att cgt gag tac tac age gtc ttt gac agg gee aac aac etc gtt 1152 
Phe He Arg Glu Tyr Tyr Ser Val Phe Asp Arg Ala Asn Asn Leu Val 
370 375 380 

ggg eta get aaa gca ate tga 1173 
Gly Leu Ala Lys Ala He 
385 390 
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<210> 2 
<211> 390 
<212> PRT 
<213> Bovine 

<400> 2 

Met Asn Phe Leu Lys Ser Phe Pro Phe Tyr Ala Phe Leu Cys Phe Gly 
15 10 15 

Gin Tyr Phe Val Ala Val Thr His Ala Ala Glu lie Thr Arg lie Pro 

20 25 30 

Leu Tyr Lys Gly Lys Ser Leu Arg Lys Ala Leu Lys Glu His Gly Leu 
35 40 45 

Leu Glu Asp Phe Leu Gin Lys Gin Gin Tyr Gly lie Ser Ser Lys Tyr 
50 " 55 60 

Ser Gly Phe Gly Glu Val Ala Ser Val Pro Leu Thr Asn Tyr Leu Asp 
65 70 75 80 

Ser Gin Tyr Phe Gly Lys lie Tyr Leu Gly Thr Pro Pro Gin Glu Phe 

85 90 95 

Thr Val Leu Phe Asp Thr Gly Ser Ser Asp Phe Trp Val Pro Ser lie 

100 105 110 

Tyr Cys Lys Ser Asn Ala Cys Lys Asn His Gin Arg Phe Asp Pro Arg 
115 120 125 

Lys Ser Ser Thr Phe Gin Asn Leu Gly Lys Pro Leu Ser lie His Tyr 
130 135 140 

Gly Thr Gly Ser Met Gin Gly lie Leu Gly Tyr Asp Thr Val Thr Val 
145 150 155 160 

Ser Asn lie Val Asp lie Gin Gin Thr Val Gly Leu Ser Thr Gin Glu 

165 170 175 

Pro Gly Asp Val Phe Thr Tyr Ala Glu Phe Asp Gly lie Leu Gly Met 

180 185 190 

Ala Tyr Pro Ser Leu Ala Ser Glu Tyr Ser lie Pro Val Phe Asp Asn 
195 200 205 

Met Met Asn Arg His Leu Val Ala Gin Asp Leu Phe Ser Val Tyr Met 
210 215 220 

Asp Arg Asn Gly Gin Glu Ser Met Leu Thr Leu Gly Ala lie Asp Pro 
225 230 235 240 

Ser Tyr Tyr Thr Gly Ser Leu His Trp Val Pro Val Thr Val Gin Gin 

245 250 255 

Tyr Trp Gin Phe Thr Val Asp Ser Val Thr lie Ser Gly Val Val Val 

260 265 270 

Ala Cys Glu Gly Gly Cys Gin Ala lie Leu Asp Thr Gly Thr Ser Lys 
275 280 285 

Leu Val Gly Pro Ser Ser Asp lie Leu Asn lie Gin Gin Ala lie Gly 
290 295 300 

Ala Thr Gin Asn Gin Tyr Gly Glu Phe Asp lie Asp Cys Asp Asn Leu 
305 310 315 320 
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Ser Tyr Met Pro Thr Val Val Phe Glu lie Asn Gly Lys Met Tyr Pro 

325 330 335 

Leu Thr Pro Ser Ala Tyr Thr Ser Gin Asp Gin Gly Phe Cys Thir Ser 

340 345 350 

Gly Phe Gin Ser Glu Asn His Ser Gin Lys Trp lie Leu Gly Asp Val 
355 360 365 

Phe lie Arg Glu Tyr Tyr Ser Val Phe Asp Arg Ala Asn Asn Leu Val 
370 375 380 

Gly Leu Ala Lys Ala lie 
385 390 

<210> 3 

<211> 3957 

<212> DNA 

<213> Artificial Sequence 

<220> 
<221> CDS 

<222> (1554) . . (2726) 
<220> 

<223> Description of Artificial Sequence: Figure 2 
<400> 3 

ctgcaggaat tcattgtact cccagtatca ttatagtgaa agttttggct ctctcgccgg 60 
tggtttttta cctctattta aaggggtttt ccacctaaaa attctggtat cattctcact 120 
ttacttgtta ctttaatttc tcataatctt tggttgaaat tatcacgctt ccgcacacga 180 
tatccctaca aatttattat ttgttaaaca ttttcaaacc gcataaaatt ttatgaagtc 240 
ccgtctatct ttaatgtagt ctaacatttt catattgaaa tatataattt acttaatttt 300 
agcgttggta gaaagcataa agatttattc ttattcttct tcatataaat gtttaatata 360 
caatataaac aaattcttta ccttaagaag gatttcccat tttatatttt aaaaatatat 420 
ttatcaaata tttttcaacc acgtaaatct cataataata agttgtttca aaagtaataa 480 
aatttaactc cataattttt ttattcgact gatcttaaag caacacccag tgacacaact 540 
agccattttt ttctttgaat aaaaaaatcc aattatcatt gtattttttt tatacaatga 600 
aaatttcacc aaacaatcat ttgtggtatt tctgaagcaa gtcatgttat gcaaaattct 660 
ataattccca tttgacacta cggaagtaac tgaagatctg cttttacatg cgagacacat 720 
cttctaaagt aattttaata atagttacta tattcaagat ttcatatatc aaatactcaa 780 
tattacttct aaaaaattaa ttagatataa ttaaaatatt acttttttaa ttttaagttt 840 
aattgttgaa tttgtgacta ttgatttatt attctactat gtttaaattg ttttatagat 900 
agtttaaagt aaatataagt aatgtagtag agtgttagag tgttacccta aaccataaac 960 
tataacattt atggtggact aattttcata tatttcttat tgcttttacc ttttcttggt 1020 
atgtaagtcc gtaactagaa ttacagtggg ttgccatggc actctgtggt cttttggttc 1080 
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atgcatgggt 


cttgcgcaag 


aaaaagacaa 


agaacaaaga 


aaaaagacaa 


aacagagaga 


1140 


caaaacgcaa 


tcacacaacc 


aactcaaatt 


agtcactggc 


tgatcaagat 


cgccgcgtcc 


1200 


atgtatgtct 


aaatgccatg 


caaagcaaca 


cgtgcttaac 


atgcacttta 


aatggctcac 


1260 


ccatctcaac 


ccacacacaa 


acacattgcc 


tttttcttca 


tcatcaccac 


aaccacctgt 


1320 


atatattcat 


tctcttccgc 


cacc tcaatt 


tcttcacttc 


aacacacgtc 


aacctgcata 


1380 


tgcgtgtcat 


cccatgccca 


aatctccatg 


catgttccaa 


ccaccttctc 


tcttatataa 


1440 


tacctataaa 


tacctctaat 


atcactcact 


tctttcatca 


tccatccatc 


cagagtacta 


1500 


ctactctact 


actataatac 


cccaacccaa 


c tcatattca 


atactactc t 


act atg 


1556 



Met 
1 

aac ttc ctt aag tct ttc cct ttc tac get ttc ctt tgt ttc ggt caa 1604 
Asn Phe Leu Lys Ser Phe Pro Phe Tyr Ala Phe Leu Cys Phe Gly Gin 

5 10 15 



tac ttc gtt get gtt act cac get get gag ate acc cgc att cct etc 
Tyr Phe Val Ala Val Thr Kis Ala Ala Glu lie Thr Arg lie Pro Leu 
20 25 30 



gaa gac ttc ttg cag aaa caa cag tat ggc ate age age aag tac tec 
Glu Asp Phe Leu Gin Lys Gin Gin Tyr Gly lie Ser Ser Lys Tyr Ser 
50 55 60 65 

ggc ttc ggt gaa gtt get age gtg cca ctt acc aac tac ctt gat agt 
Gly Phe Gly Glu Val Ala Ser Val Pro Leu Thr Asn Tyr Leu Asp Ser 

70 75 80 

caa tac ttt ggg aag ate tac etc gga acc ccg cct caa gag ttc acc 
Gin Tyr Phe Gly Lys lie Tyr Leu Gly Thr Pro Pro Gin Glu Phe Thr 

85 90 95 

gtt etc ttt gat act ggt tec tct gac ttc tgg gtt ccc tct ate tac 
Val Leu Phe Asp Thr Gly Ser Ser Asp Phe Trp Val Pro Ser lie Tyr 
100 " 105 110 

tgc aag age aat gee tgc aag aac cac caa aga ttc gat ccg aga aag 
Cys Lys Ser Asn Ala Cys Lys Asn His Gin Arg Phe Asp Pro Arg Lys 
115 120 125 

teg tec acc ttc cag aac tta ggc aaa ccc ttg tct ata cac tac ggt 
Ser Ser Thr Phe Gin Asn Leu Gly Lys Pro Leu Ser lie His Tyr Gly 
130 135 140 145 

aca ggt age atg caa gga ate tta ggc tat gat acc gtc act gtc tec 
Thr Gly Ser Met Gin Gly He Leu Gly Tyr Asp Thr Val Thr Val Ser 

150 155 160 

aac att gtg gac att caa cag aca gta gga ctt age acc caa gaa cca 
Asn He Val Asp lie Gin Gin Thr Val Gly Leu Ser Thr Gin Glu Pro 

165 170 175 

ggt gat gtc ttc acc tat gca gaa ttc gat ggc ate ctt ggt atg gca 
Gly Asp Val Phe Thr Tyr Ala Glu Phe Asp Gly He Leu Gly Met Ala 
180 185 190 



1652 



tac aaa ggt aag tct etc cgt aag gcg ctg aag gaa cat gga ctt eta 1700 
Tyr Lys Gly Lys Ser Leu Arg Lys Ala Leu Lys Glu His Gly Leu Leu 
35 40 45 



1748 



1796 



1844 



1892 



1940 



1988 



2036 



2084 



2132 
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tac cca teg etc gcg tea gag tac teg ata cct gtg ttt gac aac atg 2180 
Tyr Pro Ser Leu Ala Ser Glu Tyr Ser He Pro Val Phe Asp Asn Met 
195 200 205 

atg aac cga cac eta gta get caa gac ttg ttc teg gtt tac atg gac 2228 
Met Asn Arg His Leu Val Ala Gin Asp Leu Phe Ser Val Tyr Met Asp 
210 215 220 225 

agg aat ggc cag gag age atg etc acg ctt gga get att gat cca tec 2276 
Arg Asn Gly Gin Glu Ser Met Leu Thr Leu Gly Ala He Asp Pro Ser 

230 235 240 

tac tac aca gga tct ctt cac tgg gtt cca gtc act gtg cag cag tac 2324 
Tyr Tyr Thr Gly Ser Leu His Trp Val Pro Val Thr Val Gin Gin Tyr 

245 250 255 

tgg caa ttc act gtg gac agt gtc ace ate age ggt gtg gtt gtt gca 2372 
Trp Gin Phe Thr Val Asp Ser Val Thr lie Ser Gly Val Val Val Ala 
260 265 270 

tgt gaa ggt gga tgt caa get ate ttg gat ace ggt acg tec aag ctg 2420 
Cys Glu Gly Gly Cys Gin Ala He Leu Asp Thr Gly Thr Ser Lys Leu 
275 280 285 

gtc gga cct age age gac att etc aac att cag caa get att gga gee 2468 
Val Gly Pro Ser Ser Asp lie Leu Asn lie Gin Gin Ala He Gly Ala 
290 295 300 305 

aca cag aac cag tac ggt gag ttt gac ata gat tgc gac aac ctt age 2516 
Thr Gin Asn Gin Tyr Gly Glu Phe Asp He Asp Cys Asp Asn Leu Ser 

310 315 " 320 

tac atg cct aca gtt gtc ttt gag ate aac ggc aag atg tac cca ctg 2564 
Tyr Met Pro Thr Val Val Phe Glu He Asn Gly Lys Met Tyr Pro Leu 

325 330 335 

ace ccc tec gec tat acc age cag gat caa ggg ttc tgc acc agt gga 2612 
Thr Pro Ser Ala Tyr Thr Ser Gin Asp Gin Gly Phe Cys Thr Ser Gly 
340 345 350 

ttc cag agt gag aac cat tec cag aaa tgg ate ttg gga gat gtg ttc 2660 
Phe Gin Ser Glu Asn His Ser Gin Lys Trp He Leu Gly Asp Val Phe 
355 360 365 

att cgt gag tac tac age gtc ttt gac agg gee aac aac etc gtt ggg 2708 
He Arg Glu Tyr Tyr Ser Val Phe Asp Arg Ala Asn Asn Leu Val Gly 
370 375 380 385 

eta get aaa gca ate tga agcttaataa gtatgaacta aaatgcatgt 2756 
Leu Ala Lys Ala He 

390 

aggtgtaaga gctcatggag agcatggaat attgtatccg accatgtaac agtataataa 2816 

ctgagctcca tctcacttct tctatgaata aacaaaggat gttatgatat attaacactc 2876 

tatctatgea ccttattgtt ctatgataaa tttcctctta ttattataaa tcatctgaat 2936 

cgtgacggct tatggaatgc ttcaaatagt acaaaaacaa atgtgtacta taagactttc 2996 

taaacaattc taactttagc attgtgaacg agacataagt gttaagaaga cataacaatt 3056 

ataatggaag aagtttgtct ccatttatat attatatatt acccacttat gtattatatt 3116 

aggatgttaa ggagacataa caattataaa gagagaagtt tgtatccatt tatatattat 3176 
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atactaccca 


tttatatatt 


atacttatcc 


acttatt taa 


tgtctttata 


aggtttgatc 


3236 


catgatattt 


ctaatatttt 


agttgatatg 


tatatgaaag 


ggtactattt 


gaactctctt 


3296 


actctgtata 


aaggttggat 


catccttaaa 


gtgggtctat 


ttaattttat 


tgettcttae 


3356 


agataaaaaa 


aaaattatga 


gttggtttga 


taaaatattg 


aaggatttaa 


aataataata 


3416 


aataataaat 


aacatataat 


atatgtatat 


aaatttatta 


taatataaca 


tttatctata 


3476 


aaaaagtaaa 


tattgtcata 


aatctataca 


a t cert 1 1 acre 


r* t - f* ere* \~ cinz* r» 


yaLLL LCda L 


J J JD 


tatttaaacg 


agagtaaaca 


tatttgactt 


tttggttatt 


taacaaatta 


ttatttaaca 


3596 


ctatatgaaa 


tttttttttt 


ttatcggcaa 


ggaaataaaa 


ttaaattagg 


agggacaatg 


3656 


gtgtgtccca 


atccttatac 


aaccaacttc 


cacaggaagg 


teaggteggg 


gacaacaaaa 


3716 


aaacaggcaa 


gggaaatttt 


ttaatttggg 


ttgtcttgtt 


tgctgcataa 


tttatgcagt 


3776 


aaaacactac 


acataaccct 


tttagcagta 


gagcaatggt 


tgaccgtgtg 


cttagcttct 


3836 


tttattttat 


ttttttatca 


gcaaagaata 


aataaaataa 


aatgagacac 


ttcagggatg 


3896 


tttcaaccct 


tatacaaaac 


cccaaaaaca 


agtttcctag 


caccctacca 


actaaggtac 


3956 



3957 



<210> 4 
<211> 390 
<212> PRT 

<213> Artificial Sequence 
<400> 4 

Met Asn Phe Leu Lys Ser Phe 
1 5 

Gin Tyr Phe Val Ala Val Thr 

20 

Leu Tyr Lys Gly Lys Ser Leu 
35 

Leu Glu Asp Phe Leu Gin Lys 

50 55 

Ser Gly Phe Gly Glu Val Ala 
65 70 

Ser Gin Tyr Phe Gly Lys He 

85 

Thr Val Leu Phe Asp Thr Gly 

100 

Tyr Cys Lys Ser Asn Ala Cys 
115 

Lys Ser Ser Thr Phe Gin Asn 
130 135 

Gly Thr Gly Ser Met Gin Gly 
145 150 



Pro Phe Tyr Ala 
10 

His Ala Ala Glu 
25 

Arg Lys Ala Leu 
40 

Gin Gin Tyr Gly 



Ser Val Pro Leu 

75 

Tyr Leu Gly Thr 
90 



Ser Ser Asp Phe 
105 

Lys Asn His Gin 
120 

Leu Gly Lys Pro 



He Leu Gly Tyr 

155 



Phe Leu Cys Phe Gly 

15 

He Thr Arg He Pro 
30 

Lys Glu His Gly Leu 
45 

He Ser Ser Lys Tyr 
60 

Thr Asn Tyr Leu Asp 

80 

Pro Pro Gin Glu Phe 

95 

Trp Val Pro Ser He 
110 

Arg Phe Asp Pro Arg 
125 

Leu Ser He His Tyr 
140 

Asp Thr Val Thr Val 

160 
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Ser Asn lie Val 



Pro Gly Asp Val 

180 

Ala Tyr Pro Ser 
195 

Met Met Asn Arg 
210 

Asp Arg Asn Gly 
225 



Ser Tyr Tyr Thr 



Tyr Trp Gin Phe 

260 

Ala Cys Glu Gly 
275 

Leu Val Gly Pro 
290 

Ala Thr Gin Asn 
305 



Ser Tyr Met Pro 



Leu Thr Pro Ser 

340 

Gly Phe Gin Ser 
355 

Phe lie Arg Glu 
370 

Gly Leu Ala Lys 
385 



Asp lie Gin Gin 
165 

Phe Thr Tyr Ala 



Leu Ala Ser Glu 

200 

His Leu Val Ala 
215 

Gin Glu Ser Met 
230 

Gly Ser Leu His 
245 

Thr Val Asp Ser 



Gly Cys Gin Ala 

280 

Ser Ser Asp lie 
295 

Gin Tyr Gly Glu 
310 

Thr Val Val Phe 
325 

Ala Tyr Thr Ser 



Glu Asn His Ser 

360 

Tyr Tyr Ser Val 
375 

Ala He 
390 
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Thr Val Gly Leu 
170 

Glu Phe Asp Gly 
185 

Tyr Ser He Pro 



Gin Asp Leu Phe 

220 

Leu Thr Leu Gly 
235 

Trp Val Pro Val 
250 

Val Thr He Ser 
265 

lie Leu Asp Thr 



Leu Asn He Gin 

300 

Phe Asp He Asp 
315 

Glu lie Asn Gly 
330 

Gin Asp Gin Gly 
345 

Gin Lys Trp He 



Phe Asp Arg Ala 

380 



Ser Thr Gin Glu 
175 

He Leu Gly Met 

190 

Val Phe Asp Asn 
205 

Ser Val Tyr Met 



Ala He Asp Pro 

240 

Thr Val Gin Gin 
255 

Gly Val Val Val 
270 

Gly Thr Ser Lys 
285 

Gin Ala He Gly 



Cys Asp Asn Leu 

320 

Lys Me t Tyr Pro 
335 

Phe Cys Thr Ser 
350 

Leu Gly Asp Val 
365 



Asn Asn Leu Val 
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